Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2

Hits 1 – 20 of 23

1	Vocapia-LIMSI System for 2020 Shared Task on Code-switched Spoken Language Identification
	Barras, Claude; Le, Viet-Bac; Gauvain, Jean-Luc
	In: The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities ; https://hal.archives-ouvertes.fr/hal-03091792 ; The First Workshop on Speech Technologies for Code-Switching in Multilingual Communities, Oct 2020, Shanghai, China (2020)
	BASE
	Show details

2	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02415176 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

3	Challenges in Audio Processing of Terrorist-Related Data
	Gauvain, Jodie; Lamel, Lori; Le, Viet Bac...
	In: International Conference on Multimedia Modeling ; https://hal.archives-ouvertes.fr/hal-02387373 ; International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece (2019)
	BASE
	Show details

4	Language Recognition for Dialects and Closely Related Languages
	Gelly, Grégory; Gauvain, Jean-Luc; Lamel, Lori...
	In: Odyssey 2016 ; https://hal.archives-ouvertes.fr/hal-01744188 ; Odyssey 2016, Jun 2016, Bilbao, Spain (2016)
	BASE
	Show details

5	Improving Data Selection for Low Resource STT and KWS
	Fraga-Silva, Thiago; Laurent,Antoine; Gauvain,Jean-Luc. - 2016
	BASE
	Show details

6	Lexical speaker identification in TV shows
	Roy, Anindya; Bredin, Hervé; Hartmann, William...
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-01690342 ; Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377 - 1396. ⟨10.1007/s11042-014-1940-3⟩ (2015)
	BASE
	Show details

7	Traduction de la parole dans le projet RAPMAT
	Maynard, Hélène; Segal, Natalia; Bilinski, Eric...
	In: Journées d'Études sur la Parole ; https://hal.archives-ouvertes.fr/hal-01843418 ; Journées d'Études sur la Parole, Jan 2014, Le Mans, France (2014)
	BASE
	Show details

8	Comparing decoding strategies for subword-based keyword spotting in low-resourced languages
	Hartmann, William; Le, Viet Bac; Messaoudi, Abdelkhalek...
	In: Annual Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-01843408 ; Annual Conference of the International Speech Communication Association , ISCA, Sep 2014, Singapore, Singapore (2014)
	BASE
	Show details

9	Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification
	Sarkar, Achintya; Do, Cong-Thanh; Le, Viet-Bac; Barras, Claude
	In: ISSN: 1070-9908 ; IEEE Signal Processing Letters ; https://hal.archives-ouvertes.fr/hal-01690336 ; IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040 - 1044. ⟨10.1109/LSP.2014.2323432⟩ (2014)
	Abstract: International audience ; Most speaker recognition systems rely on short-term acoustic cepstral features for extracting the speaker-relevant information from the signal. But phonetic discriminant features, extracted by a bottleneck multi-layer perceptron (MLP) on longer stretches of time, can provide a complementary information and have been adopted in speech transcription systems. We compare the speaker verification performance using cepstral features, discriminant features, and a concatenation of both followed by a dimension reduction. We consider two speaker recognition systems, one based on maximum likelihood linear regression (MLLR) super-vectors and the other on a state-of-the-art i-vector system with two session variability compensation schemes. Experiments are reported on a standard configuration of NIST SRE 2008 and 2010 databases. The results show that the phonetically discriminative MLP features retain speaker-specific information which is complementary to the short-term cepstral features. The performance improvement is obtained with both score domain and feature domain fusion and the speaker verification equal error rate (EER) is reduced up to 50% relative, compared to the best i-vector system using only cepstral features.
	Keyword: [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing; [INFO]Computer Science [cs]; bottleneck features; i-vector; LDA; multi-layer perceptron; PCA; PLDA; Speaker verification
	URL: https://doi.org/10.1109/LSP.2014.2323432 https://hal.archives-ouvertes.fr/hal-01690336/document https://hal.archives-ouvertes.fr/hal-01690336/file/double-final.pdf https://hal.archives-ouvertes.fr/hal-01690336
	BASE
	Hide details

10	Lattice MLLR based m-vector system for speaker verification
	Sarkar, Achintya Kumar; Barras, Claude; Le, Viet Bac
	In: IEEE International Conference on Acoustics, Speech, and Signal Processing ; https://hal.archives-ouvertes.fr/hal-01836461 ; IEEE International Conference on Acoustics, Speech, and Signal Processing, Jan 2013, Vancouver, Canada (2013)
	BASE
	Show details

11	Unsupervised Speaker Identification using Overlaid Texts in TV Broadcast
	Poignant, Johann; Bredin, Hervé; Le, Viet-Bac...
	In: Proceedings of the 13th Annual Conference of the International Speech Communication Association (Interspeech) ; Interspeech 2012 - Conference of the International Speech Communication Association ; https://hal.archives-ouvertes.fr/hal-00767427 ; Interspeech 2012 - Conference of the International Speech Communication Association, Sep 2012, Portland, OR, United States. 4p (2012)
	BASE
	Show details

12	Recherche par le contenu dans des documents audiovisuels multilingues
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: ISSN: 1279-5127 ; EISSN: 1963-1014 ; Document Numérique ; https://hal.inria.fr/hal-00953796 ; Document Numérique, Lavoisier, 2010, 13 (1), pp.229-246 (2010)
	BASE
	Show details

13	Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: ISSN: 1380-7501 ; EISSN: 1573-7721 ; Multimedia Tools and Applications ; https://hal.archives-ouvertes.fr/hal-00953696 ; Multimedia Tools and Applications, Springer Verlag, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩ (2010)
	BASE
	Show details

14	Automatic speech recognition for under-resourced languages: application to Vietnamese language
	Besacier, Laurent; Le, Viet-Bac
	In: Institute of Electrical and Electronics Engineers. IEEE transactions on audio, speech and language processing. - New York, NY : Inst. 17 (2009) 8, 1471-1482
	BLLDB
	OLC Linguistik
	Show details

15	Exploitation d'un corpus bilingue comparable pour la création d'un système de traduction probabiliste Vietnamien - Français
	Do, Thi Ngoc Diep; Le, Viet-Bac; Bigi, Brigitte...
	In: TALN ; TALN 2009, Senlis, 24-26 juin 2009 ; https://hal.archives-ouvertes.fr/hal-00959202 ; TALN 2009, Senlis, 24-26 juin 2009, 2009, Unknown, pp.x-x (2009)
	BASE
	Show details

16	Mining a comparable text corpus for a Vietnamese - French statistical machine translation system
	Do, Thi-Ngoc-Diep; Le, Viet-Bac; Bigi, Brigitte...
	In: Fourth Workshop on Statistical Machine Translation ; https://hal.archives-ouvertes.fr/hal-01393602 ; Fourth Workshop on Statistical Machine Translation, 2009, Athens, Greece. pp.165 - 172, ⟨10.3115/1626431.1626466⟩ ; http://www.statmt.org/wmt09/ (2009)
	BASE
	Show details

17	Recherche par le contenu dans des documents audiovisuels multilingues
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: Actes de la conférence CORIA ; https://hal.inria.fr/hal-00954025 ; Actes de la conférence CORIA, 2009, Giens, France. pp.67-82 (2009)
	BASE
	Show details

18	Content-Based Search in Multilingual Audiovisual Documents using the International Phonetic Alphabet
	Quénot, Georges; Tan, Tien-Ping; Le, Viet-Bac...
	In: 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009) ; https://hal.inria.fr/hal-00953855 ; 7th International Workshop on Content-Based Multimedia Indexing (CBMI 2009), 2009, Chania, Crete. 3-5 June 2009 (2009)
	BASE
	Show details

19	Normalisation et alignement de corpus français et vietnamiens : Format et Logiciels
	Bigi, Brigitte; Le, Viet-Bac
	In: Actes JATD 2008 ; journées internationales d'analyse statistique des données textuelles ; https://hal.archives-ouvertes.fr/hal-01705630 ; journées internationales d'analyse statistique des données textuelles, Jun 2008, Lyon, France (2008)
	BASE
	Show details

20	Acoustic-Phonetic Unit Similarities for Context Dependent Acoustic Model Portability
	Le, Viet-Bac; Besacier, Laurent; Schultz, Tanja. - 2008
	BASE
	Show details

Page: 1 2

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern